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Abstract 

This letter studies distributed opportunistic channel access in a wireless network with decode-and- 
forward relays. All the sources use channel contention to get transmission opportunity. If a source wins 
the contention, the channel state information in the first-hop channel (from the source to its relay) is 
estimated, and a decision is made for the winner source to either give up the transmission opportunity 
and let all sources start a new contention, or transmit to the relay. Once the relay gets the traffic, it may 
have a sequence of probings of the second-hop channel (from the relay to the destination). After each 
probing, if the second-hop channel is good enough, the relay transmits to the destination and completes 
the transmission process of the source; otherwise, the relay decides either to give up and let all sources 
start a new contention, or to continue to probe the second-hop channel. The optimal decision strategies 
for the two hops are derived in this letter. The first-hop strategy is a pure-threshold strategy, i.e., when 
the first-hop channel signal-to-noise ratio (SNR) is more than a threshold, the winner source should 
transmit to the relay, and subsequently the second-hop strategy should let the relay keep probing the 
second-hop channel until a good enough second-hop channel is observed. Simulation results show that 
our scheme is beneficial when the second-hop channels have larger average SNR. 

Index Terms 

Decode-and-forward relaying, opportunistic transmission, optimal stopping, throughput maximiza¬ 
tion. 


I. Introduction 

In a distributed wireless network such as ad hoc network, normally the traffic sources use 
channel contention to obtain transmission opportunity. For example, if a source has traffic to 
send, it sends a request-to-send (RTS) to its destination, and if the RTS is successfully received, 
the destination sends back a clear-to-send (CTS), and then the source can transmit its traffic, 
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even if its channel to the destination is poor. To efficiently utilize the wireless channel, it may be 
better if a source could give up its transmission opportunity when its channel is not good, i.e., 
it does not transmit upon reception of CTS, and thus all sources immediately start a new round 
of channel contention. The challenge is: when should a channel be called “good channel”? This 
challenge can be addressed easily for a centralized network coordinated by a central controller. 
For example, in a cellular network, the base station (BS) is the central controller, and collects 
channel state information of all users for each channel. Then the BS can pick up the best user, 
which has the best channel gain, to utilize a channel, referred to as opportunistic channel access 
or channel-aware scheduling. However, in a distributed network, no such central controller exists 
to decide which user has the best channel gain. Rather, each user needs to decide based only on 
its local observation (i.e., its own channel gain) and channel statistics of other users’ channels. 
The challenge of opportunistic channel access in a distributed network was addressed in [1], A 
source first sends a probe (e.g., RTS) to its destination for channel contention. If the contention 
is successful, the destination estimates the channel signal-to-noise ratio (SNR) and feedbacks 
(e.g., by using CTS) to the source. If the channel SNR is less than a threshold value, which 
can be numerically calculated based on the statistics of users’ channels, then the source gives 
up its transmission opportunity; otherwise, the source transmits its traffic using the maximal 
achievable transmission rate of the probed channel. As follow-ups of [1], the works in [2], [3], 
[4], [5] investigate opportunistic channel access in a distributed wireless network when channel 
information is imperfect, when multiple transmissions can be successfully received, when there 
exists a delay constraint, and when a fine channel estimation could be available, respectively. For 
wireless relaying networks, distributed opportunistic channel access is investigated in [6] and [7] 
with decode-and-forward (DF) relays and amplify-and-forward (AF) relays, respectively. Next, 
the work in [6] is introduced since this letter is a follow-up of the work in [6]. A distributed 
DF relaying network is considered in [6], in which each source-destination pair is aided by a 
DF relay. If a source has traffic to send, it sends a probing packet, and based on reception of 
the probing packet, the information of channel SNR in the first hop (from the source to its relay 
and to the destination) is obtained. Then the source decides to 1) give up, or 2) transmit with 
direct link, or 3) continue to probe the second hop (from the relay to the destination). If the 
source decides to probe the second hop, then the channel SNR of the second hop is estimated, 
and it is decided either to give up or to transmit (by using direct link or relay link, whichever 


February 24, 2015 


DRAFT 



3 


has better utility). 

In [6], when the source decides to proceed to probe the second hop, if the second-hop channel 
SNR is not good, it is likely that the source will give up the transmission opportunity and all 
sources start a new contention. However, since the second-hop channel is time-varying, a natural 
question is: if a second-hop channel is poor, is it beneficial to let the relay wait until the second- 
hop channel becomes better? The rationale for this question is: if the source gives up when 
the second-hop channel is poor, then it still takes time for the system to have a successful 
channel contender later, and the successful channel contender may not have good channel SNRs 
in two hops. So it is possible to have benefits by using a relay-waiting scheme, i.e., letting the 
relay wait until the second-hop channel becomes better. This letter targets at an answer to the 
aforementioned question. In specific, we first derive an optimal strategy for the relay-waiting 
scheme. Then performance of the relay-waiting scheme and the scheme in [6] are compared 
by simulations, which demonstrate that the relay-waiting scheme can be beneficial when the 
second-hop channels have larger average SNR. 

II. System Model 

Consider a distributed DF relay network that includes a number, M, of source-destination 
pairs. Each source-destination pair has a relay assigned. First consider the case with direct links 
from sources to destinations. Similar to [6], to probe the first-hop channels, a source can send 
a probing packet, and if there is no collision, the probing packet is received by both its relay 
and its destination. The relay and the destination then can estimate the channel SNRs from the 
source to themselves. Then the relay reports its channel SNR information to the destination, and 
the destination makes decision for the first hop (give up or transmit). For this case, by reception 
of the reporting message from the relay, the destination can estimate the channel SNR from 
the relay to itself, and thus, the destination has complete channel SNR information for the two 
hops: from the source to the relay, from the source to itself, and from the relay to itself. Then 
the destination can calculate the achievable end-to-end transmission rate given as R between the 
source and itself. Therefore, although the communication from the source to the destination is 
with two hops, it can be treated as a virtual one-hop communication with achievable rate R. 
So the same method as that in [1] (which deals with single-hop ad hoc networks) can be used 
to find out the opportunistic channel access strategy. Therefore, in this letter, we investigate the 
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case without direct links between each source-destination pair. Assume channels in the first hop 
(i.e., from sources to their relays) follow independent and identically distributed (i.i.d.) Rayleigh 
fading with average received SNR being pj, while the channels in the second hop (i.e., from 
relays to destinations) follow i.i.d. Rayleigh fading with average received SNR being p g . 

The M sources use a channel contention procedure similar to those in [1] [6] [7], as follows. 
At a minislot (the duration of which is denoted as a), each source sends an RTS with probability 
p to its relay. So at each minislot, if no source transmits, i.e., the minislot is idle (the probability 
is (1 — p) M ), then all sources start a new channel contention in next minislot; if more than one 
source send RTS (the probability is 1 — (1 — p) M — Mp{ 1 — p) AI ~ l ), it means that transmissions 
of the sources collide with each other, and thus, all the sources start a new channel contention 
after a time-out value (the duration of which is equal to CTS duration) following the collision; if 
only one source sends RTS (with probability Mp(l — p) M ~ 1 ), then we call the source a winner 
source. Define an observation as the interval from the starting point of the channel contention 
until a winner source appears (i.e., its RTS is successfully received by its relay). The average 
duration of an observation can be calculated as r, = 1 -0-+ ' (trts + 

timeout ) + r RT s, in which trts and T Ume out are RTS and timeout durations, respectively. 

At the end of an observation (say, observation n), the winner source’s relay can estimate 
the channel SNR from the winner source to itself by the RTS reception, and it decides from 
two options: 1) option give-up : to give up the transmission opportunity, and notify the source 
of the decision by sending back a CTS. This CTS is also received by other sources. Thus, 
subsequently all sources can start a new contention. 2) option stop : to stop the process and 
utili z e the transmission opportunity, and send back a CTS to notify the decision. In the CTS, a 
transmission rate denoted R n is also indicated for transmission from the winner source to the 
relay. Then the winner source transmits for duration of a channel coherence time denoted as r d 
by using transmission rate R n . The optimal value of R n is derived in Section ITVl 

For observation n, if the winner source stops, denote reward Y n as the total amount of traffic 
that is sent by the winner source and received by its destination, and denote T n as the time 
duration from observation 1 until observation n plus the time used for transmissions in the two 
hops. Denote N as the stopping time, i.e., the winner sources in the first N — 1 observations do 
not stop, and the winner source in the Nth observation stops. This letter targets at an optimal 
stopping time denoted as N*, which makes the system achieve the maximal system throughput, 
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( 1 ) 


where E[-] means expectation. N* is also referred to as optimal stopping strategy. Based on 
[8, Chapter 6], we can transform problem (|T|) into a problem that maximizes reward Y N — XT N 
with A > 0. In specific, for A > 0, an optimal strategy denoted N*(X) should be found, which 
maximizes expected reward of the transformed problem: 


U( A) = sup {E[Y n{x) ] - AE[Tjv ( a)]}- 


( 2 ) 


M(A)>0 


Then if we find a A* such that U(X*) = 0, then an optimal strategy of problem O is in the 
form of N*(X) with A = A* [8]. 

Next we find optimal strategy for problem ©, which includes two parts: the optimal second- 
hop strategy and optimal first-hop strategy, discussed in the subsequent two sections. 


III. Strategy for the Second Hop 


Consider observation n. Here we first try to find the optimal strategy for the second hop, 
i.e., we assume the winner source stops and transmits to its relay with rate R n . For the second 
hop, the relay should find out its best strategy. The relay first sends an RTS to the destination, 
and the destination estimates the second-hop channel SNR denoted r g and feedbacks a CTS 
that includes the channel SNR information, referred to as a channel probing. If the achievable 
second-hop transmission rate, given as log 2 (l + r 9 ), is not less than R n , then the relay transmits 
to the destination by using transmission rate R n with duration r,/; otherwise, the relay may decide 
to give up or to continue channel probing. If the relay decides to give up, all sources start a 
new channel contention. If the relay decides to continue channel probing, then the relay waits 
for channel coherence time r,i and has a new RTS-CTS exchange with the destination (a new 
channel probing), and transmits if the achievable second-hop transmission rate is not less than 
R n , or decides to give up or to continue channel probing otherwise. This procedure is repeated 
until the relay either transmits or gives up. It can be seen that there are a sequence of decisions 
in the second hop, which makes the optimal second-hop strategy challenging. To address the 
challenge, we review second-hop strategies from a new perspective, as follows. 

Denote Si as the second-hop strategy that the relay can have up to l channel probings of its 
channel to the destination. So if the relay cannot find a second-hop channel realization with 
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achievable rate not less than R n within l channel probings, the relay is forced to give up. 
Denote V l (X ) (which is a function of A) as the net reward of strategy Si. Therefore, the optimal 
second-hop strategy should achieve net reward max{E[V’ 1 (A)],E[V 2 (A)],E[V°°(A)]}. 

The net reward expectation of strategy S i is 

EfV^A)] = Pr [r 1 > r n }(R n T d - Ar 2 ) + Pr [r 1 < r n ](-A(r i?T5 + t C ts )) 

( 3 ) 

= (1 — F g (r n ))(R n Td — A t 2 ) + F g (r n )(—X(T RTS + tcts )) 

where Pr[-] means probability of an event, t C ts is CTS transmission duration, r 2 = t RT s + 
Tcts + Fi is the time cost for probing and transmission in the second hop, F fl (•) is the cumulative 
distribution function (CDF) of the second-hop channel SNR (the subscript g stands for the second 
hop), Tg is the second-hop channel SNR in the first channel probing, r n = 2 rin — 1 is the minimum 
required SNR of the second hop for achievable transmission rate R n . 

The net reward expectation of strategy is 

E[C°°(A)] 

=Pr [r] > r n }(R n r d - Ar 2 ) + Pr[rJ < r n ](E[V' 00 (A)] - At 2 ) (4) 

=(1 - F g (r n ))(R n T d - Ar 2 ) + F 3 (r ri ,)(E[C 00 (A)] - Ar 2 ). 

From © and ©, we have 

E[V'°°(A)] - E[C X (A)] = F g (r n )(E[V°°(X)} - A r d ). (5) 

A. Case with E[C 0O (A)] > A r d 

If E[V°°(A)] > A r d , then E[C°°(A)] > E[C 1 (A)]. Now we compare E[C°°(A)] with E[C*(A)], 
l > 1. 

We have 

E[V‘(A)] = Pr [r] > r n ](R n T d - X r 2 ) + Pr [r] < r n , r 2 g > r n \(R n T d - 2Ar 2 ) + ... 

+ Pr [r] < r n ,..., r l ~ l < r n , r l g > r n \(R n r d - IXt 2 ) 

+ Pr [r] < r n , ..., r^ 1 < r n , r l g < r n \(-(l - 1)At 2 - A (t rts + t C ts )) (6) 
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in which r g , r g , ■■■,r l g are channel SNRs of 1st, 2nd, ..., Zth channel probing of the relay. E[V°°(A)] 
can be expressed as 


E[y°°(A)] = Pr [r] > r n ](R n r d - Ar 2 ) + Pr [r] < r n ,r g > r n ](R n T d - 2A r 2 ) + ... 

+ Pr [r] < r n ,r l ~ l < r n , r\ > r n ]{R n r d - l\r 2 ) 

+ Pr [r] < r n , < r n ,r l g < r ri ](E[l/°°(A)] - l\r 2 ). 


So 


E[1/°°(A)] - E[y z (A)] = Pr [r] < r n , ...,r'- 1 < r n ,r l g < r n ](E[y°°(A)] - A r d ) 

= (F 9 (r n )) z (E[y°°(A)] - A r d ) > 0 (7) 

which means the optimal second-hop strategy should be: the relay keeps probing the second-hop 
channel until the achievable rate is not less than R n . 


B. Case with E[1A°°(A)] < \r d 

If E[V°°(A)] < A r d , from © we have E[V°°(A)] < E[1/ 1 (A)]. Now we compare E[1/ 1 (A)] 
with E[^(A)], l > 1. 

K[V\X)}-E[V l (X)} 

= - (E[V°°(A)] - E[17 1 (A)]) + (E[V 0O (A)] - E[^'(A)]) 

= -(E|V~(A)]-Ara)-(-()'(E[V“>(A)] -At,) 

=F,(t„)( - 1 + (F,(r„))'- 1 )(EH'“(A)] - At,) > 0 
in which (a) comes from © and ©. and ( b ) comes from F g (r n ) < 1 and E[V°°(A)] < A T d . 
Thus, the optimal second-hop strategy should be: the relay probes the second-hop channel only 
once, and transmits if the achievable transmission rate is not less than R n , or gives up otherwise. 

Overall, for the second hop, depending on comparison of E(V°°(A)] with A r d , the relay should 
either probe the second-hop channel once, or keep probing the second-hop channel until the 
achievable second-hop rate is not less than R n . 
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IV. Strategy for the First Hop 


Based on optimal strategy in the second hop, now we derive optimal strategy for the first 
hop. In the first hop, at observation n, once the RTS of the winner source (i.e., the source that 
wins channel contention) is received by its relay, and the first-hop channel SNR denoted rf{n) 
is estimated, then the decision is either give-up or stop (i.e., to transmit), whichever has higher 
reward. If the decision for the first hop is give-up, then the net reward is —A t C ts (since a CTS 
is needed to notify the decision); if the decision for the first hop is to transmit with rate R n , 
the net reward is max {E[V 1 (A)], E[V 2 (A)],..., E[V°°(A)]} — \{tcts + 7d), in which tcts + Td 
is time cost in the first hop: the relay uses a CTS to notify the source of the decision and the 
source transmits with r d duration (noting that the time cost in the subsequent second hop is 
included in max{E[V 1 (A)],E[V 2 (A)],..., E[V°°(A)]}). 

First consider E[V°°(A)] < A r d for the second hop. Then based on discussion in Section ITlI-Bl 
max{E[V rl (A)], E[V 2 (A)],..., E[V°°(A)]} = E[V 1 (A)], so the net reward of transmission in first 
hop is E[V 1 (A)] — X{t C ts + T d)- Since E[V°°(A)] < Xr d , from © we have 

E[l/‘(A)] = (1 - F J (r„))E[V'”(A)] + F s (r n )\r d 

\o) 

< (1 - F g {r n ))XT d + F g (r n )XT d = Xr d 

which leads to E[V 1 (A)] — X(TcTS+T d ) < —Xt C ts- In other words, the net reward of transmission 
in the first hop is less than the net reward of give-up in the first hop, and thus, the winner source 
will always give up in the first hop. Therefore, when we calculate the net reward of transmission 
in the first hop, we can ignore “E[V°°(A)] < A r d \ Thus, we focus on E[V°°(A)] > A r d , and based 
on discussion in Section ITlI-Al we have max {E[V 1 (A)], E[V 2 (A)],..., E[V°°(A)]} = E[V°°(A)]. 
So the net reward of transmission (stopping) in the first hop is 

E[V°°(A)] — X{t C ts + r d) 

=R n T d — , — tAt 2 — A (t C ts + T d) (9) 

-L fg\ r n) 

(d) Iil 

= log 2 (l + r n )T d - Xtcts - X T d - Xe p s r 2 

in which (c) comes from E[V°°(A)] = R n T d — 1 _ F 1 , ^ At 2 which is from ©, and (d) is from 
F g (r n ) = 1 — e p n (Rayleigh fading) and r n = 2'" —1. The net reward © is not a monotonically 
increasing function of r n . So we need to set up an optimal r n that makes the net reward maximal. 
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Define function 4>(x) = log 2 (l + x)t<i — Xtcts ~ Xr d — Ae p sr 2 , which is a concave function. 
To find the optimal x, denoted x* , that maximizes cj)(x), we can solve = 0, which leads to 


Td 


A si 
= — e p s r 2 . 


( 10 ) 


(1 + x*) In 2 p g 

x* can be calculated from (fTOl) numerically. So r n should be set to x* if feasible. However, it 
may not be feasible to set r n to be x* since r n should be no more than the first-hop channel SNR 
rf(ri). Thus, overall we should set r n = min{r/(n), x*} and R n = log 2 (l + min{r/(n),x*}). 

Recall that an optimal stopping strategy of problem © with A* satisfying U(X*) = 0 is 
an optimal stopping strategy of problem ©. So next we focus on optimal stopping strategy of 
problem © with A*. Maximal expected reward U(X*) of problem © should satisfy an optimality 
equation [8]: 


min {rj! (n) ,x* } 

E[max { log^l + min {r f (n),x*})r d -X*( t C ts + r d + e ^ r 2 ), 

f/(A*) - AVcts}] - An = U(X*). 


Since U(X*) = 0, the optimal equation is rewritten as 

min{rj ( n),x* } 

E[ max { log 2 (1 + min {r f (n),x*})r d - A *(t C ts + r d + e ^ r 2 ), 

— X*Tcts}] — X*Ti (11) 


from which A* can be calculated numerically. 

Accordingly, the optimal stopping strategy in the first hop is given as 

min{r ^ ( n),x * } 

N*(X*) = min [n > 1 : log 2 (l + min{r/(n), x*})r d - A *(t C ts + r d + e ^ r 2 ) 

> — X*Tcts} ( 12 ) 

in which x* can be calculated from (fTOl) with A = A*. 

The left handside of the inequality in (fl2l) is a non-decreasing function of rj(n). Denote 

min{ry (n),cc* } 

ff as the solution of r^(n) for log 2 (l + min {rf(n), x*})r d — A *(t C ts + r d + e r 2 ) = 

—A *tcts- Then the optimal stopping strategy in the first hop is rewritten as N*(X*) = min {n > 
1 : rf(n) > f/}. Thus, at observation n, if the fist-hop channel SNR rj (n) is less than the 
threshold ff, the winner source gives up; otherwise, the winner source stops, i.e., transmits with 
rate R n = log 2 (l + min {r/(n), x*} ), and subsequently the relay keeps probing the second-hop 
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channel until an achievable rate not less than R n . The values of f/ and x* can be calculated 
offline, and thus, the optimal strategy is a pure-threshold strategy, with very low computational 
complexity. 


V. Performance Evaluation 

Computer simulations are carried out to evaluate our proposed optimal relay-waiting scheme. 
The simulated network has 18 source-destination pairs, with other parameters set as: a = 20//s, 
trts = 103^S ,r C TS = Ttimeout = 106/iS, T d = 0.8ms, p = 0.1, p f = 1. We vary the average 
second-hop SNR p g from 2 to 20. For each p g value, we first numerically calculate fy, and use 
the value of fy as the pure threshold in the simulations to obtain the system throughput. The 
simulation results are shown in Fig. HI Simulation results for the scheme in [6] (with direct links 
not considered) are also shown. It can be seen that, when the average second-hop SNR p g is 
below 5, the scheme in [6] achieves higher system throughput. But when p g > 5, our scheme 
achieves better throughput performance. Indeed, for a network with the second-hop channels 
having larger average SNR, if the current probed second-hop channel realization has a poor 
SNR, letting the relay wait may be more time-efficient compared to giving up. 



Fig. 1. System throughput of proposed scheme and the scheme in [6J. 
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VI. Conclusion 

In this letter, we exploited distributed opportunistic channel access in networks with DF relays, 
and we answered the question whether or not to let the relay wait for a better second-hop channel 
if the current second-hop channel is not good. For such relay-waiting, we derived the optimal 
strategies in the two hops. It was shown that the first-hop strategy is a pure-threshold strategy, 
while the second-hop strategy is to keep probing until a good enough channel is observed. Our 
simulations demonstrated that the optimal relay-waiting scheme should be adopted when the 
second-hop channels have larger average SNR. 
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